Historic,  archived  document 

Do  not  assume  content  reflects  current 
scientific  knowledge,  policies,  or  practices. 


United  States 
Department  of 
Agriculture 

Forest  Service 

Intermountain 
Research  Station 

Research  Paper 
INT-382 


A  Theoretical  Basis 
for  Modeling  Probability 
Distributions  of 
Fire  Behavior 


Ralph  A.  Wilson,  Jr. 


.4  .6 

False  Alarm  Rate 


■  (if 


C7 


THE  AUTHOR 


RALPH  A.  WILSON  is  a  research  physicist  assigned  to 
the  Fire  Behavior  research  work  unit  at  the  Intermountain 
Fire  Sciences  Laboratory,  Missoula,  MT.  His  under- 
graduate studies  in  mathematics  and  physics  were  at 
Willamette  University,  Salem,  OR  (B.A.  1956),  and  his 
graduate  work  was  in  spectroscopy  at  University  of 
Oregon,  Eugene  (M.A.  1958).  He  worked  in  infrared 
detectors  and  optics  at  the  Naval  Ordnance  Test  Station 
(now  Naval  Weapons  Center),  China  Lake,  CA,  from  1957 
to  1964.  From  1964  to  1975,  he  worked  on  the  Forest 
Service's  airborne  infrared  forest  fire  surveillance  project 
at  the  Northern  Forest  Fire  Laboratory  (now  Intermountain 
Fire  Sciences  Laboratory). 

RESEARCH  SUMMARY 

A  method  for  predicting  a  range  or  distribution  of  ex- 
pected fire  behavior  is  developed  to  help  fire  managers 
weigh  options.  A  fire  model  of  marginal  burning  states  pro- 
duces "likelihood"  distributions  for  the  expected  fire 
states.  These  distributions  depend  only  on  the  current  site 
observations  and  are  independent  of  the  prior  fire  state 
history  of  the  site  of  application. 

Using  elementary  decision  theory,  a  fire  manager  first 
chooses  a  fire  state  prescription  for  the  site  to  be  burned. 
Then,  from  an  appropriate  decision  goal  and  the  prior 
distribution  of  the  prescribed  state,  a  threshold  value  of 
the  likelihood  for  that  fire  state  is  calculated  and  used  to 
decide  whether  the  site  conditions  are  within  the  burn 
prescription,  thence  whether  to  burn  or  not  burn. 

Two  types  of  error  rates  are  calculated  for  the  marginal 
burning  states:  the  probability  that  a  fire  might  burn  when 
predicted  to  not  burn  and  the  probability  that  it  might  not 
burn  when  predicted  to  burn.  A  parametric  plot  of  the  two 
error  rates  is  used  as  a  test  of  the  marginal  fire  state 
model  and  demonstrates  the  model's  utility. 


September  1987 

Intermountain  Research  Station 
324  25th  Street 
Ogden,  UT  84401 


A  Theoretical  Basis  for  l\/lodeling 
Probability  Distributions  of 
Fire  Behavior 

Ralph  A.  Wilson,  Jr. 


INTRODUCTION 

In  recent  years  great  progress  has  been  made  in  apply- 
ing fire  behavior  models  in  fire  management  systems 
nationwide— in  several  cases,  worldwide.  Similarly,  these 
fire  models  have  provided  fire  intelligence  for  an  un- 
anticipated variety  of  administrative  decision  and  planning 
functions. 

The  wide  acceptance,  and  particularly  the  unique  situa- 
tions to  which  these  models  have  been  applied,  has  caused 
problems  to  arise  in  two  interrelated  areas:  first,  it  has 
been  necessary  to  adjust  and  "fine-tune"  fire  behavior 
systems  to  properly  respond  to  different  geographic  loca- 
tions, and  second,  the  resultant  fire  behavior  information 
provided  is  often  insufficient,  inappropriate,  or  misleading 
in  some  special  applications.  Both  problems  would  be 
greatly  reduced  by  developing  a  definitive  method  of  esti- 
mating uncertainties,  calculating  risks  of  error,  or  fore- 
casting a  range  of  possible  fire  behavior. 

This  paper  documents  a  theoretical  basis  for  implement- 
ing predictive  distributions  of  fire  behavior  and  is  intended 
primarily  for  the  use  of  specialists  who  design  wildland 
fire  models  and  systems  for  applying  them.  Although 
Bayesian  methods  are  used,  the  idea  is  not  to  specifically 
recommend  a  Bayesian  Decision  System,  but  rather  to 
show  how  predictive  distributions  could  be  used  by  the 
decision  makers  to  specify  the  inherent  limitations  of 
predictive  systems,  to  suggest  methods  for  system  op- 
timization for  specific  applications,  and  to  demonstrate 
procedures  for  testing  a  model's  performance.  I  have  tried 
to  "keep  it  simple"  in  order  to  get  basic  ideas  across; 
thus,  I.  ignore  such  niceties  as  distinguishing  between  con- 
tinuous and  discrete  distributions,  rigorous  use  of  prob- 
ability densities,  etc.,  and  I  avoid  complicated  derivations. 

The  central  theme  of  this  discussion  is  that  the  fire 
model  must  be  universally  applicable  because  fire  is  a 
physical  process  governed  by  universal  physical  law; 
however,  a  fire  model  is  an  approximation  of  real-world 
fire  behavior  that  implies  a  degree  of  uncertainty- 
uncertainty  that  must  be  understood  by  both  the  model 
designer  and  the  manager  using  the  model.  Although  a 
model  has  universal  applicability,  it  is  valid  only  within  the 
domain  and  range  of  variables  for  which  it  is  designed, 
tested,  and  verified. 

In  addition,  it  is  emphasized  that  the  prior  historical 
frequency  of  any  predicted  fire  state  is  unique  to  each 
locality,  may  be  only  qualitatively  known  by  local  "expert" 
observers,  and  is  outside  the  predictive  capability  of  a 
physical  model  of  fire  behavior. 


The  analysis  and  solution  of  these  difficulties  may  be 
enhanced  by  using  what  are  sometimes  called  "Bayesian" 
techniques— techniques  that  are  quite  controversial,  yet 
having  some  analytical  merit.  The  controversy  centers  on 
the  admissability  of  subjective  or  personal  opinion  for 
estimating  probabilities  as  a  basis  for  decision  making. 
Very  strong  conviction  is  held  on  both  sides  of  the  con- 
troversy: the  modern  Bayesian  theorists  emphasize  the 
subjective  personal  decision  characterized  by  the  posses- 
sive nature  of  "his  goal"  and  "his  objective."  In  the 
following  development,  reference  to  subjective  estimation 
is  made  where  it  may  apply.  The  decision  mechanics,  risk, 
and  error  analysis  techniques  are  reliable  and  standard 
engineering  practice.  These  concepts  will  be  illustrated 
using  the  following  example. 

EXAMPLE  OF  FIRE  STATE  MODEL 
AND  OBSERVATIONS 

In  field  application,  a  fire  model  is  combined  with  prior 
knowledge  of  the  local  historical  record  to  arrive  at  a 
posterior  estimate  (prediction)  of  events.  The  model  itself 
requires  observations  of  specific  variables  for  the  current 
conditions.  (The  terms  "prior"  and  "posterior,"  respective- 
ly, refer  to  knowledge  before  and  after  the  observations; 
"observer"  refers  to  the  manager-decision  maker.) 

Models  are  commonly  constructed  on  the  basis  of 
posterior  results  of  research  experiments,  but  then 
improperly  represented  as  posterior  field  predictions. 
Neglected  is  the  fact  that  the  prior  conditions  of  the  ex- 
periment are  different  from  those  in  the  field  application! 
A  properly  designed  model  must  express  the  likelihood 
that  the  current  situation  would  be  observed  if,  indeed, 
the  given  fire  state  would  occur.  When  so  designed,  the 
fire  model  is  independent  of  the  "prior"  conditions  of  the 
specific  application  and  the  experimental  conditions  from 
which  it  was  developed. 

It  has  been  demonstrated  (Wilson  1985)  that  the 
marginal  limits  of  combustion  may  be  represented  to  an 
observer  as  probabilistic  distributions  of  "fire  states."  The 
distributions  are  functions  of  a  parameter,  n,  which  sum- 
marizes in  a  single  number  all  current  fuel  observations 
and  physically  resembles  a  dimensionless  burning  rate;  this 
parameter  is  a  deterministic  function  of  the  physical  fuel 
properties  in  a  manner  similar  to  traditional  fire  models: 


1 


ln(o(!<!  UQJ 

where  the  nomenclature  is  similar  to  the  original 
Rothermel  (1972)  model  of  fire  spread: 

o     =  fuel  particle  surface  area  to  volimie  ratio 

/?    =  fuel  bed  packing  ratio,  solid  fuel  volume/total  bed 
volume 

6    =  fuel  bed  depth 

h    =  heat  of  combustion  of  the  fuel  pyrolizate  gases 
=  heat  for  pyrolysis  (formerly  heat  of  preignition)  of 
fuel 

Q„  =  heat  of  moisture  desiccation  (heat  scaling  factor). 

The  numerator  of  equation  1  represents  the  heat  produc- 
tion rate  of  the  fire  and  the  denominator  represents  the 
energy  needed  to  heat  and  pyrolize  the  fuel. 

The  burning  state  of  a  fire  is  only  probabilistically  deter- 
mined even  under  the  most  elaborate  experimental  con- 
trols. Figure  1  shows  the  (posterior)  experimental  prob- 
ability distributions  for  fire  states,  6,  and  P^.  For  purposes 
of  this  example,  these  fire  states  are  defined  by  the  frac- 
tion of  fireline  aflame: 

6 1  is  the  state  with  fraction  of  fireline  aflame  larger 
than  zero 

Pi  is  the  state  of  fire  that  is  out;  ^  is  read  "not  b" 

62  is  the  state  with  fraction  of  fireline  aflame  larger 
than  0.5 

is  the  state  with  fraction  of  fireline  aflame  less  than 
0.5 

63  is  the  state  with  fraction  of  fireline  aflame  larger 
than  0.99 

/>3  is  the  state  with  fraction  of  fireline  aflame  less  than 
0.99. 

In  this  example,  the  range  of  6  is  boimded  with  both  upper 
and  lower  limits:  at  the  lower  limit  by  bi,  fires  with 
zero  percent  of  fireline  aflame  and  at  the  upper  limit  by  63 
with  99  percent  aflame.  By  this  mechanism  we  were  able 
to  define  and  observe  marginal  burning. 

In  the  discussion  below,  a  fire  state  "window"  is  defined 
by  the  conjunction  of  two  elementary  states.  For  example, 
bz  =  (^  1,^2)  is  the  state  with  fraction  of  fireline  greater 
than  zero  and  less  than  0.5.  These  fire  states  are  defined 
in  terms  of  an  observable  fire  characteristic  that  might  be 
related  to  a  "fire  effect,"  treatment,  or  prescription  such 
as  mineral  soil  exposure,  fuel  reduction,  etc.  Promising 
alternative  fire  states  could  be  defined  by  fire  intensity, 
flame  length,  or  by  spread  rate  itself,  for  example.  The 
fire  state  as  defined  above  will  serve  for  illustration. 

The  results  of  Wilson's  (1985)  experiment  with  almost 
500  fires,  as  shown  in  figure  1,  are  potentially  misleading 
when  applied  to  field  problems.  The  results  correctly  show 
the  fire  state  probability  distributions,  P{b\n),  for  the 
source  experiment.  But  that  experiment  was  designed  to 
examine  fires  near  the  limit  of  sustained  combustion;  thus, 
38  percent  of  the  test  fires  went  out,  P{bi)  =  0.62, 
52  percent  burned  with  more  than  half  of  the  fireline 
aflame,  P{b2)  =  0.52,  and  32  percent  burned  with  a  full 
contiguous  flame  front,  Pib^)  =  0.32.  A  resource  manager 


0  2  4  6  8  10         12  14 


Extinction  Index,  n 


Extinction  Index,  n 

Figure  1— The  posterior  conditional  probability 
distributions  for  the  three  marginal  fire  states 
defined  in  the  text  (after  Wilson  1985). 


is  going  to  expect  more  than  62  percent  of  the  brush  fires 
in  southern  California  in  August  to  be  self-sustaining. 

The  marginal  frequency  distribution  of  fire  states,  P(6), 
(namely,  the  "prior"  probability  of  b)  in  Wilson's  experi- 
ment is  unique  to  that  experiment;  that  specific  distribu- 
tion is  a  consequence  of  the  experimental  design.  Thus, 
the  results  plotted  in  figure  1  are  "biased"  by  the  design 
of  the  experiment  and  different  results  should  be  expected 
in  other  applications  that  will  have  different  "prior"  fire 
state  probability  distributions. 

In  the  following  discussion,  the  experimental  results 
(fig.  1)  are  transformed  into  "likelihood  ratios"  (fig.  2)  that 
measure  the  strength  of  the  observational  evidence,  n, 
alone;  this  measure  is  independent  of  the  prior  probabil- 
ities and  thus  is  more  imiversally  applicable.  The  review  of 
decision  theory  and  error' analysis  follows  (Egan  1975). 
The  illustrative  example  is  the  work  of  the  author. 


2. 


(0 


0) 


4 

1  1 

A 

b2   1  b3 

3 

- 

2 

L(nn)  =  2.0  / 

1 

0 

1  — =^i--r 

°  1           1           1  1 

0  2  4    5.15  6  8  10         12  14 


Extinction  Index,  n 


Extinction  Index,  n 

Figure  2— The  likelihood  ratios  for  the  marginal 
fire  states. 


PROBABILITIES,  DECISIONS,  AND 
RISK 

Let  us  suppose  a  fire  manager  must  decide  "Do  I  try 
this  prescribed  burn  today?"  The  objective  of  the  burn 
might  be  to  expose  mineral  soil  and  reduce  fuels.  This 
treatment  requires  the  fire  state  to  burn  within  specific 
limits  defined  by  the  fire  prescription.  The  manager  must 
decide  "yes"  (to  burn)  or  "no"  (not  to  burn).  In  either 
case,  there  are  two  possible  events  (outcomes  or  results), 
depending  on  fuels  and  environmental  conditions:  (1)  the 
fire  might  burn  in  a  state,  b,  to  accomplish  his  manage- 
ment goal  or  (2)  it  might  not  burn,  p,  in  that  state. 

Many  decision  problems  can  be  reduced  (or  combined)  to 
just  two  responses  (decision  alternatives)  and  two  events 
(or  outcomes).  The  manager's  task  is  to  decide,  Y  or  N,  on 
the  basis  of  uncertain  evidence,  n,  whether  his  objective 
will  obtain,  b,  or  not  obtain,  p.  For  example,  there  are  four 
possible  outcomes:  the  conjunction  (Y,6)  is  a  "correct  ac- 
ceptance" or  a  "hit"  and  its  complement  (N,6)  is  a  "false 
rejection"  or  a  "miss";  the  conjunction  (N,^)  is  a  "correct 
rejection"  and  its  complement  (Y,^)  is  a  "false  accep- 


tance" or  a  "false  alarm."  In  decision  theory,  the  two  con- 
junctions (N,b)  and  iY,p)  are  called  errors  of  the  first  kind 
and  errors  of  the  second  kind,  respectively;  the  first  is  a 
missed  detection  (rejecting  a  true  hypothesis),  the  second 
is  a  false  alarm  (accepting  a  false  hypothesis)  (see  table  1). 

His  knowledge  being  imperfect,  the  manager's  goal  is  to 
make  a  decision  with  the  highest  probability  of  achieving 
his  objective.  The  manager  cannot  be  criticized  for  a 
"bad"  result  of  such  rational  decision.  In  the  real  world  of 
imperfect  knowledge,  a  non-zero  probability  exists  that  a 
good  decision  may  have  a  bad  result  and,  conversely,  a 
bad  decision  may  have  a  good  result. 

The  performance  of  an  observer  (the  fire  manager)  in  a 
single  decision  trial  is  assessed  in  terms  of  the  probabil- 
ities of  a  correct  decision,  P(C)  and  error  P(E): 

P(C)  =  P(Y,6)  +  P(N,p) 

P(E)  =  P(Y,^)  +  P(N,6) 

where,  for  example,  P(Y,b)  is  called  the  joint  probability, 
that  is,  the  probability  that  the  event,  b,  and  the  response, 
Y,  both  occur  on  the  same  trial. 

Objective,  Goal,  Function,  Rule 

We  now  divide  the  decision  process  into  a  logical 
sequence:  first,  the  manager-observer  predetermines  an 
explicit  management  objective  and  a  specific  decision 
goal:  from  the  goal  a  decision  function  is  defined,  which 
value  depends  on  the  decision  of  the  observer;  in  turn,  a 
decision  rule  is  determined  that  maximizes  (or  minimizes) 
the  decision  function.  For  example, 

—  the  objective  might  be  to  burn  a  fire  in  the  state,  b, 

—  the  goal  might  be  to  have  the  largest  probability  of  a 
correct  decision, 

—  then  the  decision  function  is  P(C)  as  shown  above, 

—  the  decision  rule  is  then,  to  "say  YES"  if  the  post- 
erior probabilities  (determined  by  the  observations  and 
prior  probabilities)  favor  maximizing  the  decision  function, 
maxP(C). 

Examples  of  classical  decision  goals  include: 

1.  The  resource  manager  may  wish  to  simply  maximize 
the  probability  of  a  correct  decision,  maxP(C),  as  indicated 
above. 

2.  In  cases  with  significant  values  at  risk,  the  manager 
may  wish  to  maximize  the  expected  value  (or  minimize 
risk). 

Table  1— Event-response  conjunctions,  A,  and  conditional 
probabilities,  B 


A.  Response  B.  Response 


Event 

Y 

N 

Event 

Y 

N 

b 

(Y,t) 
"hit" 

(Kb) 
"miss" 

b 

P{Y\b) 
"hit 
rate" 

P(N|b) 
"miss 
rate" 

H 

(Y./)) 
"false 
alarm" 

(N,/)) 
"correct 
reject" 

H 

P(Y^) 
"false 
alarm 
rate" 

P(N^) 
"reject 
rate" 

3 


3.  An  observer  may  wish  to  minimize  the  maximum 
probable  error  rate,  the  "minimax  observer";  this  case  is 
of  particular  interest  when  the  maxP(C)  observer  has 
unknown  (or  very  uncertain)  prior  probabilities. 

4.  Other  examples  of  decision  goal  optimization  are  con- 
tained in  the  literature;  most  are  variations  of  these  ex- 
amples. (A  fire  department  might  fix  a  given,  tolerable 
false  alarm  rate  and  then  work  to  minimize  the  rate  of 
missed  fire  detections;  this  is  the  goal  of  the  classical 
Neyman-Pearson  observer,  for  example.)  We  will  confine 
our  discussion  to  the  above  three  examples  for  illustration. 

In  classical  discussion,  probabilities  are  interpreted  in  a 
frequency  sense;  in  an  indefinite  number  of  observations, 
an  event  could  be  expected  to  occur  in  some  explicit  frac- 
tion of  the  repetitions.  But  probability  is  commonly  used 
in  another,  subjective  sense  (Larson  1969);  for  example, 
"The  probability  is  10  percent  that  it  will  rain  tomorrow," 
or  "The  chances  are  8  to  2  that  the  fire  intensity  will  be 
between  3,000  and  5,000  Btu/s-ft^."  These  latter  examples 
of  subjective  probabilities  measure  an  individual's  "degree 
of  behef"  in  a  proposition  (Savage  1954).  That  expert  opi- 
nion (the  result  of  experience)  should  be  expressed  in 
terms  of  subjective  (or  personal)  probabilities  is  an  implicit 
tenet  of  the  Bayesian  approach  to  decision  theory  (Slovic 
and  Lichenstein  1971). 

The  so-called  prior  probabilities,  P{b)  and  PiP),  concern 
the  relative  frequency  of  occurrence  of  the  respective 
events  and  are  usually  estimated  from  historical  data.  The 
prior  historical  occurrence  of  any  distribution  of  fire  states 
must  certainly  vary  from  locality  to  locality  and  the  effect 
is  neglected  in  current  application.  Data  sources  (other 
than  personal  experience)  might  include  documented  fire 
histories  and  climatologic  records,  for  example.  Some 
authors  emphasize  the  admissibility  of  uncertainty  and 
"subjective  degrees  of  belief  in  the  prior  probabilities  and 
advise  the  choice  of  prior  probabilities  with  large  variance 
if  one  is  less  certain.  One  should  always  use  the  best  infor- 
mation available.  In  a  specific  decision  context,  prior  prob- 
ability data  is  assumed  to  be  fixed  (constant  and  unalter- 
able for  that  decision)  and  the  occurrence,  b  or  p,  on  any 
given  trial  is  independent  of  what  may  have  happened  on 
any  previous  trial. 

In  addition  to  prior  knowledge,  the  rational  manager 
also  makes  observations  of  the  current  situation  and  incor- 
porates this  (imperfect)  knowledge  into  his  decision.  The 
observer  first  estimates  the  likelihood  for  good  or  bad  out- 
come based  on  skillful  evaluation  of  the  observations,  n, 
alone.  The  observations,  n,  (in  future  models)  might  in- 
clude an  exhaustive  list  of  all  known  physical  variables 
affecting  fire  behavior  such  as  fuels,  topography,  and 
weather;  of  course,  any  prediction,  Y  or  N,  will  still  have 
some  chance  of  being  incorrect.  In  this  process,  the  ra- 
tional decision  maker  uses  a  fire  model,  either  system- 
atically, as  in  equation  1,  or  intuitively,  to  calculate  the 
likelihood  rates  at  which  the  observations  alone  favor  each 
of  the  four  event/response  conjunctions  (table  1).  Then  the 
decision  is  made  by  comparing  the  possible  outcomes  with 
the  prior  odds  of  the  current  application  (if  they  are 
known). 

The  "hit"  rate,  P(Y\b),  is  the  conditional  probability  of  a 
response  "yes"  given  the  event,  b,  would  occur;  and 
similarly  for  the  other  conditional  probabilities  (table  1). 


Note  that  P(Y,b)  is  the  probability  of  a  "hit"  and  P(Y\b) 
is  the  "hit  rate."  The  joint  probabilities  of  each  possible 
outcome  are  as  follows: 

P(Y,6)  =P(Y|6)P(6)  correct  acceptance,  "hit" 

P(N,b)  =  P(N\b)P{b)  incorrect  rejection,  "miss" 

P(N,^)  =  P(N\p)P{p)  correct  rejection,  "reject" 

P(J,P)  =  P(Y\p)P(p)  incorrect  acceptance,  "false 

alarm." 

and 

P(C)  =  P(Y\b)P{b)  +  P(N\P)P(P)  (2) 
P(E)  =  P(N|6)P(6)  +  P(Y\P)P{P).  (3) 
It  also  follows  that: 


P(Y,b)  +  P(N,6)  +  P(Y,P)  +  P(N,p)  =  1.0 
P(6)  +  P{p)  =  1.0 
P(Y\b)  +  P(N\b)  =  1.0 
P(Y\P)  +  P(N|^)  =  1.0. 


(4) 
(5) 
(6) 
(7) 


The  foregoing  equations  are  a  result  of  how  the  decision 
situation  is  set  up  and  the  algebra  of  probabilities.  For  ex- 
ample, equation  5  says  (or  implies)  that  the  fire  will  accom- 
plish the  objective,  b,  or  it  will  not;  there  is  no  middle 
ground  or  alternative.  Equations  6  and  7  say  that  I  must 
make  a  decision;  for  example,  if  I  refuse  to  make  a  deci- 
sion (after  I  have  accepted  the  objective),  then  one  of 
either  Y  or  N  must  occur  by  default.  Equation  4  says  that 
one  of  the  results  (event/responses)  of  table  1  must  occur, 
and  taken  with  equations  2  and  3,  my  decision  will  be 
right  or  wrong! 

In  the  interest  of  generality,  we  note  that  most  decision 
problems  can  be  reduced  to  the  two-event,  two-response 
situation.  To  illustrate,  a  certain  fire  treatment  may  re- 
quire a  residence  time,  b^,  flame  lengths  not  to  exceed,  62. 

etc  6j.;  this  series  of  specifications  may  be  summarized 

in  a  single  prescription,  b^,  which  is  the  simple  conjunction 
of  the 'individual  requirements: 

bx  =  (^1.  bo,  &;■) 
Similarly,  observational  data  on  fuels,  rif,  meteorology,  n^, 
and  topography,  n,,  will  be  represented  by  the 
conjunction: 

==  (nf,n^,nt,  .  .  .) 

and  furthermore,  the  joint  probability  that  b^.  and  n^.  occur 
is  given  by 

P(b^,n^)  =  P(bi,b2,bs,.  .  .,b,,\nf,n^,nt,. .  .)  Pinf,n^,nt,.  .  .) 

=  P(b,|n,)P(nJ. 

The  premise  of  the  formal  decision  procedure  is  that  the 
vector,  bj.,  contains  sufficient  and  necessary  information 
to  best  define  the  management  objective  and  that  the  vec- 
tor, Hj,  be  a  proper  decision  discriminator. 

The  so-called  posterior  conditional  probability,  P(bj.|nj) 
(hereafter  written  P{b\n))  is  the  single  most  important 
probability  for  the  decision  maker.  Many  "old  firehorses" 
are  respected  because  of  their  skill  at  estimating  prior 
probability  (from  previous  experience)  combined  with 
estimating  (most  often  by  subjective  intuition)  the  prob- 
ability that  a  result  will  occur,  given  the  current  observa- 
tions, and  also  for  the  courage  to  act  despite  risk.  The 
following  development  describes  a  procedure  to  system- 
atically optimize  this  decision  process. 


4 


The  Bayesian  model  (variously  called  Bayes  rule, 
theorem  or  function,  after  the  Reverend  Thomas  Bayes, 
ca.  1763)  states  that 

Pin\b,)Pib,) 
I(P(n|fe,)P(6,)) 

_  P{n\b,)P{b,) 
Pin) 

provided  that  the  bj  are  mutually  exclusive  and  exhaustive. 
P{b^\n)  is  the  posterior  probability  that     is  true,  taking 
into  account  the  new  data,  n,  and  the  prior  probability, 
P(6j);  that  is,  P(6,;|to)  is  a  function  of  6j  with  the  "given" 
parameter  set  to  n.  P(n|6j)  is  the  conditional  probability 
that  n  would  be  observed  when  we  assume  6,  is  known; 
that  is,  P{n\bi)  is  a  function  of  n  with  the  "given"  param- 
eter set  to  6,.  (Because  ?i  is  a  continuous  function  of  the 
fuel  bed  parameters,  these  probabilities  are  known  rigor- 
ously as  probability  densities.  This  distinction  is  not 
germane  to  this  discussion,  so  we  will  maintain  the  nota- 
tion, P,  etc.,  for  simplicity.  Bayes  theorem  is  succinctly 
discussed  by  Hahn  and  Shapiro  [1967].)  This  latter  distri- 
bution, P{n\b,),  thus  models  the  current  physical  condi- 
tions independent  of  the  prior  occurrence  of  6  ^ ,  and  should 
be  more  universally  applicable.  P(w|6,)  is  determined  from 
previous  observational  experience  or  from  independent  ex- 
perimental research  as  demonstrated  in  the  example.  It  is 
sufficient  here  to  note  the  following  identities: 

P(n,6)  +  P{n,p)  =  Pin)  (9) 
Pib\n)  +  P(p\n)  =  1.0  (10) 


or 


(11) 


P(P\n)  =  1  -  P{b\n),  etc. 

It  is  convenient  to  form  the  ratio  of  equation  8  with 
respect  to  the  two  states  b  and  p  (in  order  that  the 
dependence  on  the  frequency  distribution  of  observations, 
P{n),  be  removed);  b  and  p  are  two  arbitrary,  mutually 
exclusive  and  exhaustive  fire  states,  namely,  those  that 
will  burn  within  some  fire  prescription  and  those  that  will 
not,  respectively.  Then 


P{b 

n) 

P(n 

b) 

PIP 

n) 

'  Pin 

P) 

Pjb) 


(12) 


The  left-hand  side  of  equation  12  is  the  ratio  of  posterior 
probabilities  (conditional  on  the  observations,  n)  and  gives 
the  odds  that  the  fire  will  burn  in  the  state  b.  Henceforth, 
"odds"  will  specifically  mean  a  ratio  of  probabilities 
whereby  odds  may  assume  values  greater  than  1. 

The  right-hand  side  of  equation  12  shows  explicitly  that 
two  types  of  information  enter  into  the  calculation  of  the 
posterior  odds.  The  second  term  on  the  right,  the  ratio  of 
prior  probabilities,  summarizes  the  observer's  imperfect 
knowledge  about  the  relative  frequency  of  occurrence 
(odds  of  b  vs.  P)  before  the  evidence,  n,  was  observed,  as 
before. 

The  first  ratio  on  the  right  of  equation  12  is 


Lin)  = 


Pin 

b) 

Pin 

P) 

(13) 


Lin)  is  called  the  likelihood  ratio  and  gives  the  likeli- 
hood (odds)  of  observing  the  evidence,  n,  on  a  6  trial 
relative  to  observing  the  same  evidence  on  a  |i  trial;  Lin) 


contains  exclusively  the  information  provided  by  the 
universal  fire  model.  It  is  implicit  that  the  evidence,  n,  is 
"good"  data  and  that  it  is  appropriate  to  the  decision 
process  to  the  degree  that  it  makes  the  likelihood  ratio  a 
good  decision  discriminator.  In  the  example  (fig.  2A), 
Wilson's  experimental  results  are  transformed  to  remove 
the  dependence  on  his  prior  experimental  design  and  are 
presented  as  likelihood  ratios;  the  likelihood  ratios  for  the 
p  states  shown  in  figure  2B  are  the  respective  inverses  of 
the  b  state  ratios  as  a  consequence  of  equation  13.  The 
functions  L(n)  for  the  individual  fire  states  are  shown  to 
be  monotonic  continuous  functions  of  the  parameter,  n, 
and  thus  suitable  discriminators  for  decision  criteria.  Note 
again  that  the  prior  odds  of  achieving  a  certain  fire  state 
(or  fire  prescription)  will  change  from  place  to  place  (and 
time  to  time),  but  lack  the  benefit  of  specific  information, 
n,  while  the  likelihood  ratio  responds  only  to  the  current 
physical  conditions,  n,  in  the  same  manner  for  any  given 
place  or  time. 

The  MaxP(C)  Observer 

The  objective  is  to  burn  a  fire  in  the  state  b.  The  goal  of 
the  maxP(C)  observer  is  to  have  the  maximum  probability 
of  a  correct  decision;  cost  and  reward  values  are  not  in- 
volved. But  the  probability  estimate  of  a  correct  decision 
depends  on  the  current  observations  of  the  physical  fire 
environment  and  the  prior  frequency  distribution  of  the 
fire  states  in  the  given  local  situation. 

As  already  shown,  most  basic  decisions  may  be  reduced 
to  just  two  events,  b  and  p ,  and  two  responses,  Y  and  N, 
with  the  resulting  four  event-response  conjunctions.  On 
each  trial,  the  observer's  decision  is  based  on  the  specific 
evidence,  n.  We  assume  that  the  observer  has  calculated 
(subjectively  or  otherwise)  the  prior  probabilities  and  the 
likelihood  ratio  of  the  evidence,  n.  Thus,  the  maximum 
P(C)  observer  knows  the  posterior  odds,  Pib\n)IPip\n),  of 
the  event,  b,  etc. 

With  reference  to  table  1,  clearly  the  "correct"  conjunc- 
tions (Y,6)  and  (N,|i)  are  preferred  by  the  maximum  P(C) 
observer,  that  is,  we  wish  to  maximize  the  decision  func- 
tion (equation  2): 

P(C)  =  PiY\b)Pib)  +  Pi'N\P)PiP). 

Following  the  development  above,  this  observer's  decision 
rule  is  to  say  "yes"  if  the  posterior  odds  favor  the  event 
b;  otherwise,  say  "no."  Thus,  he  should  say  "yes"  if 


>  1 


P(6 

n) 

P(P 

n) 

or  by  rearrangement,  using  equations  12  and  13,  we  have 
the  MaxP(C)  decision  rule: 

Pip) 


if  Lin)  > 


Pib)' 


say  "yes,"  otherwise  say  "no. 


No  larger  P(C)  can  be  realized  by  an  observer  who  adopts 
any  other  decision  rule.  This  criterion  also  minimizes  the 
error  probability  of  equation  3. 

Suppose  a  manager  wishes  to  burn  a  fire  in  the  state  62 
and  in  the  present  local  situation  (known  to  be  poor  burn- 
ing conditions  by  prior  experience),  the  prior  odds  suggest 
that  62  is  unlikely,  say  P(62)  =  1/3,  inferring  a  threshold 
value  of  Lin): 


5 


L(no) 


Pih) 


2/3 
1/3 


=  2.0 


as  shown  in  figiire  2A.  Then,  if  the  observation,  n  (based 
on  fuel  moisture  or  fuel  loading)  exceeds      =  5.15,  he 
proceeds  with  the  prescribed  burn.  Note  that  if  P{h2)  is 
more  likely,  the  threshold,  L(no)  is  smaller  and  he  may  try 
the  burn  at  lower  n,  that  is,  higher  fuel  moisture  or 
lighter  fuel  load;  and  conversely,  for  more  unlikely  P{b<i). 
Thus,  a  rational  format  is  suggested  whereby  a  universally 
applicable  fire  model  is  appropriately  adaptable  to  any 
local  situation! 

The  Expected  Value  Observer 

The  manager  faced  with  a  decision  involving  major 
values  at  risk  (the  risk  of  large  costs  or  potentially  large 
benefits)  will  have  at  hand  all  relevant  information  to 
estimate  a  payoff  matrix  associated  with  the  net  value  of 
each  conjunction  as  shown  in  table  2.  This  observer  also 
has  information  to  estimate  the  posterior  odds,  etc.  We 
construct  two  decision  functions  dependent  on  the  evi- 
dence, n,  and  the  respective  responses,  "yes"  and  "no": 

1.  The  expected  value,  E{V\n,Y),  if  the  observer  says 
"yes,"  and 

2.  The  expected  value,  E(V\n,W),  if  the  observer  says 
"no." 

The  two  values  of  the  payoff  matrix  associated  with  the 
"yes"  response  are  weighted  by  the  appropriate  posterior 
probability  rate 

E{V\n,Y)  =  P(6|n)FY,6  +  P(^|n)FY,^.  (14) 

Similarly,  for  the  response  "no" 

E{V\n,^)  =  P{b\n)V^,  +  P{]l>\n)V^_^.  (15) 

Then,  clearly,  the  observer  should  say  "yes"  if  the 
expected  value  of  the  "yes"  response  is  greater  than  the 
expected  value  of  the  "no"  response,  that  is, 

if  £'(F|n,Y)  >  E{V\n,'N)  say  "yes,"  otherwise  say 
"no." 

From  equations  14  and  15,  this  rule  is  expressed  in  terms 
of  the  posterior  odds: 


If 


P(6 

n) 

P(P 

n) 

> 


N,6 


say  "yes,"  otherwise  say  "no," 

or  in  terms  of  the  likelihood  ratio  and  prior  odds,  the 
expected  value  decision  rule  becomes 


If  L(n)  >  L(no)  = 


(16) 


N,6 


Pib)  ^Y,6 

say  "yes,"  otherwise  say  "no." 

Note  that  if  the  payoff  matrix  has  equal  values  for  the 
respective  b  and  p  events,  then  the  maximum  expected 
value  observer  obtains  the  same  result  as  the  maxP(C) 
observer.  The  second  ratio  on  the  right-hand  side  of  equa- 
tion 16  contains  exclusively  the  values  and  costs  involved 
in  the  decision.  The  numerator  expresses  the  "yes-no" 
value  difference  (the  value  of  being  correct  minus  the 
value  of  error),  if  f>  prevails;  the  denominator  similarly  if 
6  prevails.  The  value  ratio  is  an  expression  of  the  relative 
importance  of  the  two  events,  ji  and  b.  In  many  cases. 


Table  2— Payoff  matrix  of  values  for 
event/response  conjunctions 


Event 

Response 

Y 

N 

b 

Y,  D 

value  of 
success 

N,  & 

cost,  missed 
opportunity 

H 

cost  of 

savings  for 

failure 

not  trying 

where  explicit  values  are  lacking,  the  manager  assigns  a 
numerical  factor  that  expresses  his  rational  judgment  of 
the  relative  importance  of  the  two  events.  Many  rational 
decisions  are  made  to  proceed  with  a  task,  often  against 
large  odds,  simply  because  the  manager  needs  (values?)  to 
get  the  job  done.  An  example  for  a  specific  prescription 
window  is  illustrated  below. 

Decisions  Without  Prior  Odds  and  ROC 
Curves 

The  observer  may  use  a  threshold  value  of  the  likelihood 
ratio  as  the  basis  of  his  decision,  even  though  the  prior 
odds  are  unknown,  thus  basing  the  decision  on  the  obser- 
vations alone  (this  is  the  same  as  assuming,  for  example, 
that  P{b)  =  P(p)).  (The  detractors  of  the  modern  Bayesian 
theorists  argue  rhetorically  that  if  the  "prior"  probability 
distributions  are  unknown,  then  this  assumption  is  equi- 
valent to  "the  equal  distribution  of  ignorance.")  Or,  as  in 
figure  2A,  he  may  simply  want  the  likelihood  ratio  of  the 
observations  to  favor  b  over  ^  by  2:1  odds,  etc.  A  given 
threshold,  L(no),  impHes  a  specific  cutoff  value,  «o>  the 
equivalent  basis  of  the  decision,  namely,  a  decision  rule 
(defined  below)  may  require  "say  yes"  if  n>no,  otherwise 
"say  no."  Furthermore,  the  probability  distributions, 
P{n\b)  and  P{n\p),  are  assumed  to  be  known  as  in  the 
illustrative  example.  Then  we  can  define  the  "miss  rate" 
and  "false  alarm  rate,"  respectively,  as: 

the  miss  rate, 

P(N\b)  =  P{n  <no\b) 


=  J     Pin\b)  dn 

CO 

-  J  P{n\b)  dn, 


P{b\n)Pin) 
P{b) 


=  1 


and  false  alarms, 
P(Y\P)  =  P{n>n,\P) 


dn  (17) 


(17A) 


=  /  P{n\p)-dn  =  J 


P{f>\n)P{n) 
PiP) 


dn.  (18) 


"0  "0 

Thus,  the  mechanics  for  calculating  the  error  rates  associ- 
ated with  the  observational  model  are  specified.  Note  also 


6 


that  the  other  two  conditional  probabilities  for  the  truth 
.  .-^  (table  IB)  are  determined  by  use  of  equations  6  and  7. 
Then  the  two  error  rates  are  plotted  parametrically  over 
the  domain  of  no(-  <»  <      <  °°)  as  in  figure  3B.  (Such 
plots  are  called  receiver  operating  characteristic  (ROC) 
curves  and  were  first  developed  by  electronic  engineers 
working  on  search  and  detection  systems  during  the 
1940's  and  1950's,  thus  accounting  for  much  of  the 
terminology:  "false  alarm,"  "missed  detection,"  etc.)  The 
"miss"  error  conditional  probability,  P(N|6)  is  plotted  on 
the  ordinate  and  the  "false  alarm,"  P(Y\P),  is  plotted  as 
the  abscissa.  The  operating  curve  thus  generated  is  char- 
acteristic of  the  observation  system,  namely,  it  is  depen- 
dent only  on  the  distribution,  P(n|6);  note  that  these  error 
rates  are  independent  of  the  prior  probabilities  because  we 
assumed  a  decision  rule  based  only  on  a  critical  value  of 
the  observations,  Hq. 

It  is  instructive  to  evaluate  the  derivative  of  the  miss 
rate  with  respect  to  the  false  alarm  rate  at  the  cutoff,  Wq! 
from  equations  17A  and  18,  we  have: 


dP(N 

b) 

-Pino 

b) 

dP(Y 

P) 

Pino 

P) 

«0 


Thus,  the  slope  at  a  point  on  the  error  operating  curve  is 
equal  to  the  negative  of  the  likelihood  ratio,  L(n),  for  the  n 
value  associated  with  that  point,  recognizing  that  riQ  is 
simply  a  parameter  of  the  foregoing  construction.  Any 
decision  maker  who  chooses  an  operating  point  on  the 
system's  ROC  curve  is  called  a  "Likelihood  Ratio 
Observer"  irrespective  of  which  decision  goal  he  uses  to 
fix  the  operating  point,  L{no),  on  the  ROC  curve. 

Let  us  consider,  momentarily,  the  observer  whose  goal  is 
simply  that  the  hit  rate  should  equal  the  false  alarm  rate, 
that  is,  he  wants  to  pick  a  Y-N  cutoff,  Hq,  such  that  when 
he  says  "yes"  he  has  a  successful  burn,  b,  at  least  as  often 
as  he  says  "yes"  and  fails,  p.  The  limiting  worst  case, 

P(Y|6)  =  P(Y|^), 

is  shown  in  figure  3A  as  the  straight  line  diagonal  between 
the  points  (0,1)  and  (1,0).  This  observer  is  primarily  con- 
cerned with  the  results  when  his  answer  is  "yes,"  so  con- 
sider the  sum  of  joint  probabilities: 

P(Y)  =  P(Y,6)  +  P{Y,P) 

=  P(Y|6)P(6)  +  P(Y|^)P(^). 

Impose  the  limiting  condition,  P(Y|6)  =  P(Y||)),  then 
because  P{b)  +  P{p)  =  1,  it  follows  that 

P(Y)  =  P{Y\b)  =  PiY\p)  (20) 

which  means  that  this  observer's  objective  is  achieved 
whenever  the  probability  of  a  "yes"  response  is  the  same 
when  either  condition  6  or  ^  occurs,  that  is,  he  leaves  the 
choice  to  chance  and  achieves  his  objective.  The  negative 
diagonal  in  figure  3A,  (0,1)  to  (1,0),  is  called  the  "chance 
line."  The  probability  of  error  for  this  observer  is 

P(E)  =  P(N,6)  +  P(Y,p) 

=  P(N|6)P(6)  +  P(Y\p)P{p)  as  before. 

Applying  equation  20  and  the  usual  identities  and  rear- 
ranging terms,  we  have 

P(E)  =  P(6)(l  -  2P(Y))  +  P(Y)  (21) 

from  which  we  see  that: 


False  Alarm  Rate,  PCYI*) 


0  .2         .4         .6         .8  1.0 


False  Alarm  Rate,  P(YI<D) 


Figure  3 — Receiver  operating  characteristic 
(ROC)  example  plot. 


1.  If  the  state  6  is  certain,  P(6)  =  1,  then  the  probability 
of  error,  P(E)  =  (1  -  P(Y))  =  P(N). 

2.  Alternately,  if  p  is  certain,  P(6)  =  0,  then  P(E)  = 
PfJ). 

3.  Most  importantly,  if  P(Y)  =  0.5  (if  the  response 
"yes"  is  determined  by  the  flip  of  a  fair  coin,  P(Y)  =  P(N) 
=  0.5),  then  this  observer's  error  rate  is  independent  of 
the  prior  probabilities,  P(6)  or  P(/))  as  in  equation  21; 


7 


furthermore,  all  entries  in  the  2x2  matrix  (table  IB)  of 
conditional  probabilities  are  equal, 

P(Y|6)  =  P(Y\P)  =  P(N|6)  =  P(N\P)  =  0.5,  and 

the  observer  is  operating  at  the  (0.5,0.5)  point  on  the  ROC 
chance  line.  At  this  point,  (0.5,0.5)  in  figure  3A,  the  oper- 
ating curve  perpendicularly  intersects  the  line  of  equal 
errors  and  the  maximum  probable  error  is  a  minimum; 
note  that  this  is  the  point  on  the  chance  operating  curve 
that  lies  closest  to  the  ROC  origin,  the  origin  being  the 
point  of  zero  error.  Furthermore,  if  the  observer  biases  his 
coin  so  that  the  probabilities  of  "yes"  and  "no"  are  un- 
equal, then  the  operating  point  (the  2x2  matrix  of  condi- 
tional probabilities)  moves  away  from  (0.5,0.5)  along  the 
chance  line  resulting  in  an  increase  in  the  maximum 
probable  error  rate. 

A  decision  system's  performance  is  judged  by  the  loca- 
tion of  its  operating  curve.  Good  performance  is  measured 
in  the  space  below  the  chance  line  by  how  close  the  oper- 
ating curve  approaches  the  origin,  indicating  low  error 
rates.  It  is  possible  for  a  decision  system  to  operate  in  the 
space  above  the  chance  line  by  using  bad  decision  rules  or 
bad  observational  data.  "Bad  information  is  worse  than  no 
information."  The  manager-observer  should  learn  to  recog- 
nize such  cases,  where  he  is  better  off  flipping  a  fair  coin 
than  using  bad  information. 

The  Minimax  Observer 

Recalling  that  "good"  operating  curves  occupy  the  space 
below  the  chance  line,  and  are  thus  convex  downward, 
where  either  error  rate  should  be  monotonically  decreasing 
as  the  other  increases  as  in  the  example  (figure  3B),  we 
can  suggest  a  decision  rule  that  minimizes  the  maximum 
probable  error  rate,  independent  of  any  knowledge  of 
prior  probabilities.  By  geometric  inspection  of  figure  3B, 
one  may  determine  the  point  of  closest  approach  of  an 
operating  curve  (analytic  or  otherwise)  to  the  origin.  The 
shortest  line  segment  from  the  origin  to  a  point  on  the 
operating  curve  is  perpendicular  to  the  tangent,  -L(no), 
at  that  point.  Even  for  imperfect  and  empirically  derived 
operating  curves  that  point  will  be  more  or  less  close  to 
the  intersection  with  the  line  of  equal  error  rates. 

Our  goal,  again,  is  to  minimize  the  probability  of  error. 
Setting  the  derivative  of  P(E)  with  respect  to  the  false 
alarm  rate,  P(Y\p),  equal  to  zero  gives  a  minimum  and 
not  a  maximimi  because  -L{n)  is  an  increasing  fimction  of 
the  abscissa,  P(Y  I ^)  (equation  3): 

P(E)  =  P(Y\j!,)P(P)  +  P(N\b)Pib) 

d  P(E) 


dP(Y\p) 


=  0 


=  Pip) 


dP(N 

b) 

dP(Y 

Pib) 


then  again,  by  equation  19,  we  have 

P(P) 

p{by 


dP(N 

b) 

dP(Y 

P) 

=  -Lino)  = 


Thus,  L(no)  =  PQ))IP{b),  as  before,  but  here  we  have 
demonstrated  that  the  point  (false  alarm  rate,  miss  rate) 
at  which  the  slope  of  the  operating  curve  is  -L(no),  is  the 
operating  point  where  the  maximum  probable  error  is  a 


minimum;  that  is,  the  maximum  probable  error  over  all  n 
but  a  minimum  over  all  possible  P(Y|/i)  and  similarly  for 
P(N|6). 

The  ROC  is  a  known  characteristic  of  the  observation/ 
prediction  model.  To  operate  with  minimum  error  rate,  the 
observer  should  use  the  minimax  decision  rule:  if  L(n)  > 
L(no),  say  "yes,"  otherwise  say  "no,"  where  L{n)  is  the 
result  of  observation  as  before  and  where  -L(no)  is  equal 
to  the  slope  of  the  ROC  curve  at  the  point  of  closest  ap- 
proach to  the  origin;  for  the  minimax  observer  L(wo)  tends 
to  be  more  or  less  close  to  the  line  of  equal  errors  (where 
L(no)  =  1),  depending  on  the  precise  shape  of  the  ROC  at 
the  point  of  minimum  error  (note  that  this  observer's  goal 
is  to  minimize  the  error  rate  and  not  the  total  error  as 
the  maxP(C)  observer  above). 

An  irrational  observer  could  include  in  his  decision  rule 
a  few  values  of  n  for  which  corresponding  values  of  L{n) 
are  less  than  some  of  those  not  included  in  his  rule;  so,  he 
responds  Y  to  those  observations  of  yi  that  represent  less 
evidence  for  h  than  for  other  good  observations  to  which 
he  responds  N.  Such  an  observer  is  not  a  likelihood  ratio 
observer!  Inescapably,  he  is  still  working  within  the  ROC 
plot,  somewhere  in  the  space  above  the  system's  ROC 
curve.  It  must  be  emphasized  that,  for  any  given  false 
alarm  rate,  the  miss  rate  cannot  be  made  smaller  than 
that  for  the  likelihood  ratio  observer.  (Similarly,  for  any 
given  miss  rate,  the  false  alarm  rate  cannot  be  smaller.) 
Note,  however,  that  the  error  rates  may  all  vary  with  the 
decision  rule  at  the  option  of  the  observer. 

A  city  fire  department  chooses  a  very  low  fire  detection 
threshold  to  reduce  errors  of  the  first  kind  (missed 
detections)  and  hence  must  tolerate  many  false  alarms; 
alternatively,  a  forester  may  pass  up  several  missed 
opportunities  because  he  cannot  risk  a  bad  burn.  In  both 
instances,  the  managers  are  not  acting  as  minimax 
observers  to  minimize  total  error,  but  must  consider  the 
values  and  prior  probabilities  as  in  the  previous  examples 
above.  Both  are  considered  good  likelihood  ratio  observers 
if  they  choose  an  operating  point  {L{n))  on  their  ROC 
curve. 

Prescription  Windows 

While  developing  error  equations  17  and  18,  we  assumed 
a  Y/N  cutoff  in  terms  of      i^^^  LIuq))  because  the  arith- 
metic was  easier;  L{n)  is  an  equivalent  discriminator 
because  it  is  always  a  piecewise  monotone  function  of  n. 
L{n)  is  the  more  practical  in  many  applications;  for  exam- 
ple, the  likelihood  ratio  for     =  (^'1,^2)  in  the  example  is 
calculated,  following  equation  13: 


L{n)  = 


P{n 


Pin 


b.) 


h)' 


as  shown  in  figure  4A.  The  decision  threshold  in  6 ^  is 
specified  by  a  single  value  of  the  odds,  L(no),  whereas  to 
accompHsh  the  same  purpose,  two  very  complicated  critical 
values  of  n  would  be  required:  a  cuton  value,  Uq  i  near  the 
bi  limit,  and  cutoff  value,  n^  2  near  the  P2  limit. 

In  figure  4B,  the  likelihobd  ratios  are  plotted  also  for 
the  burning  windows  (61,^3)  and  (62,^*3),  respectively,  the 
fractions  of  fireline  aflame  between  0.01  to  0.99  and  be- 
tween 0.5  to  0.99.  Note  that  the  manager  might  improve 


8 


CO 
DC 

■D  2 
O 

o 


1 

- 

1 

1  1 

A 

/-N(bi.«2) 

L(no)  =  2.0 

/  \ 

— 

1 

1   

- 

1  1 

4 

0.1 


10 


12 


14 


'0.2 


Extinction  Index,  n 


C  4 


"O 

8  2 


1  _ 


1  1 

(bi.-Bs)// 

1       1  1 

0 

\\(b2,C3) 

L(no)  =3.0        /  / 

-  (bi,*2y// 

1  ^--k;;^==.=u^ 

1 

0  2  4  6  8  10         12  14 

Extinction  Index,  n 

Figure  4 — Likelihood  ratios  for  conjunctions  of 
fire  states  (prescription  windows). 


the  likelihood  (odds)  of  hitting  his  burning  window  (by 
observing  a  value  of  n)  if  the  window  is  wider  as  in  the 
former  case,  or  the  window  is  moved  to  better  burning 
conditions  as  in  the  latter  case.  Similarly,  if  he  knows  that 
his  prior  odds  are,  say,  3:1  against  the  prescription  (61,^2). 
that  is,  L(W())  =  3,  then  he  cannot  rationally  expect  to  hit 
that  prescription;  whereas,  by  widening  the  window  (as  in 
61,1)3)  two  effects  improve  his  chances:  first,  L{n)  will  im- 
prove as  in  figure  4B,  and  secondly,  the  prior  odds  against 
(61,^3)  are  expected  to  be  smaller,  that  is,  L(no)  <  3,  be- 
cause, again  the  historical  record  will  show  more  fires  in 
(61,1)3)  than  in  (61,^2)- 

Suppose,  now,  a  manager  has  an  economically  important 
fire  treatment  to  perform;  the  fire  prescription  is  (61,^2) 
as  above  with  the  same  prior  odds,  P{P)IP(b)  =  3.  As  an 
expected  value  observer,  the  manager  has  considered  all  of 
the  relevant  treatment  costs  and  benefits  to  construct  a 
payoff  matrix  with  the  following  components: 


•  If  the  burn  is  successful,  the  net  present  value  would 
be  Fy  5  =  $30  per  acre.  In  this  simple  example,  net 
value  is  composed  of  a  depreciated  future  benefit  of 
$50/acre,  less  $10/acre  fixed  planning  and  preparation 
costs  and  $10/acre  for  labor  and  supplies. 

•  The  missed  opportunity  value,      ^  =  -$15/acre,  in- 
cludes the  $10/acre  fixed  cost  and  $5/acre  associated 
with  the  lost  opportunity  and  delayed  burning 
schedule. 

•  The  cost  to  try  and  fail,  Fy,^  =  -$25/acre,  is  com- 
posed of  $10/acre  fixed  cost,  $10/acre  labor  and  sup- 
ply, and  $5/acre  loss  to  present  net  value  due  to  site 
damage. 

•  And,  the  value  of  correctly  deciding  not  to  burn,  Vn,^ 
=  -$10/acre,  is  just  the  planning  fixed  cost. 

Then  the  decision  threshold,  L{nQ),  becomes  (by  equation 
16), 

P{p)     VnJ  -  VyJ 


L{no)  = 


P{h)         Vy,,   -  V, 


N,6 


=  3 


(-10)  -  (-25) 


=  3 


1 


=  1.0. 


(30)  -  (-15)  3 

Thus,  the  effect  is  to  lower  the  decision  threshold,  L(no), 
which  is  readily  justified  by  the  expected  payoff.  Note  that 
the  "fixed  planning  cost"  has  no  resultant  effect  on  the 
decision  function,  basically  because  that  cost  was  incurred 
(by  some  previous  decision)  before  we  arrived  at  this  deci- 
sion point  and  is  an  obligation  regardless  of  the  decision. 
Also  note  that  generally  (not  always),  the  two  errors  will 
have  less  value  than  their  respective  correct  conjunctions, 
thus  both  numerator  and  denominator  of  the  value  ratio 
(eq.  16)  will  have  positive  values.  (An  exercise  for  the 
reader  is  to  consider  the  dilemma  of  the  manager  who 
"must  decide  between  the  lesser  of  two  evils.")  In  many 
practical  examples,  Vyj,  will  be  the  dominant  positive 
value;  hence  the  value  ratio  will  have  a  positive  value,  less 
than  1,  so  the  payoff  justifies  a  decision  that  might  other- 
wise be  against  the  prior  odds. 

Of  course  the  cost  and  benefit  values  in  the  value  ratio 
are  associated  with  the  specific  prescription,  b  =  (61,^2). 
for  example.  The  manager  may  consider  the  tradeoffs  of 
adjusting  the  prescription  window  and  the  corresponding 
value  ratios  to  arrive  at  the  best  possible  decision. 

If  we  vary  the  decision  rule  to  successively  enlarge  the 
range  of  acceptable  n,  we  will  start  with  a  large  I/(no) 
(fig.  4)  at  the  ROC  point  (0,1)  where  the  acceptable  range 
is  zero  and  (by  any  rational  decision  rule)  we  will  always 
say  "no"  where  we  will  never  have  a  false  alarm,  where 
the  miss  rate  equals  1.0,  and  the  (joint)  probability  of  a 
missed  opportunity  is  equal  to  the  (prior)  probability  of  the 
opportunity.  And  we  will  end  with  a  small  L(no)  at  the 
ROC  point  (1,0)  where  we  always  say  "yes,"  never  miss 
an  opportunity,  but  the  false  alarm  rate  equals  1.0  and  the 
(joint)  probability  of  a  false  alarm  is  equal  to  the  (prior) 
probability  that  the  opportunity  does  not  occur  (refer  to 
equations  2  to  7).  Furthermore,  the  range  of  acceptable  n 
must  be  nondecreasing  over  the  entire  path  because  the 
ROC  slope,  -L{n),  varies  so. 


9 


Model  Performance 

An  ROC  curve  is  uniquely  determined  for  each  pair  of 
distributions,  P{n\b),  P{n\P),  and  is  the  same  for  all  deci- 
sion rule  criteria.  There  exists,  then,  a  family  of  ROC 
curves  that  depend  on  the  parameter,  b;  which  family  is 
represented  by  the  fire  states,  bi,  62-  ^3- 

The  ROC  plot  for  Wilson's  model  is  shown  in  figure  5A. 
The  continuous  curves  are  generated  by  equations  17  and 
18,  and  the  respective  curves  in  figure  1.  The  data  points 
are  the  appropriate  experimental  counts  of  burns,  b,  and 
not-burns,  ^ ,  in  sequential  increments  of  the  parameter  n 
of  equation  1.  The  extinction  index,  n,  of  equation  1  is  the 
independent  variable  of  the  parametric  ROC  plot  for  each 
member  of  the  family,  b^  (i  =  1,2,3).  Each  curve  begins 
with  n  =  0  near  the  ROC  point  (1,0)  and  ends  near  (0,1) 
with  n>14. 

Neglecting  the  parameter,  n,  there  is  no  significant  dif- 
ference in  the  ROC  family  of  curves.  The  utility  of  the 
model  is  demonstrated  by  the  low  values  of  the  points  of 
minimum  error  rate:  (0.12,0.12),  (0.13,0.13),  and  (0.15,0.15), 
respectively  for  b^,  b2,  and  63.  But  these  points  occur 
when  the  parameter,  n,  equals  4.25,  4.62,  and  5.75, 
respectively,  demonstrating  that  each  fire  state  will  have 
its  own  individual  ROC. 

The  data  points  for  b  j  at  low  n  in  figure  5A  depart  sig- 
nificantly from  the  ideal.  This  variance  has  roots  in  the 
physical  observation  process.  The  large  miss  error  is 
because  several  fires  burned  that  should  not  have  been 
expected  to  burn;  the  low  n  indicates  poor  burning  condi- 
tions, the  state  b  i  being  fires  that  did  not  go  out.  By  equa- 
tion 1  these  were  fuel  beds  with  light  loading  and/or  high 
moisture  content.  It  was  noted  during  the  experiment  that 
a  few  of  these  fires  could  be  sustained  by  two  or  three  in- 
dividual fuel  sticks  that  may  have  had  anomalously  high 
heat  of  combustion  (pitch  and  resins)  or  low  moisture.  In 
other  fires  with  very  light  loading,  the  intersection  of 
three  or  more  sticks  constituted  a  "jackpot"  with  signifi- 
cantly hotter  fire  behavior.  The  spatial  distribution  of 
these  jackpots  could  not  be  controlled,  even  with  extreme 
experimental  precaution,  to  preclude  some  of  the  fires  be- 
ing sustained  by  jackpot-to-jackpot  propagation,  and  which 
would  have  been  predicted  to  go  out  if  the  fuel  bed  had 
been  perfectly  homogeneous.  These  inhomogeneities  are 
insignificant  in  heavier  loaded  fuel  beds,  larger  n,  or  do 
not  affect  our  expectations  for  the  hotter  burning  states. 
Thus  there  are  two  points  to  make: 

1.  Inhomogeneity  is  a  physical  variable  that  is  not 
"recognized"  in  the  fire  model,  is  not  incorporated  in  the 
likelihood  ratios,  nor  in  the  ROC's,  etc.  It  is,  however,  a 
parameter  whose  statistics  could  vary  from  one  locality  to 
another  and  so  affect  the  prior  odds  on  which  the  manager 
bases  his  decision. 

2.  These  ROC  curves  indicate  an  ideal  limit  of  perfor- 
mance; they  represent  the  best  that  you  can  expect  to  do 
with  the  information  observed  and,  in  some  cases,  even 
that  might  be  difficult  to  achieve. 


.2  .4  .6         .8  1.0 

False  Alarm  Rate,  P(Y  I  *) 


0  .2         .4         .6         .8  1.0 

False  Alarm  Probability,  P(Y,4b) 

Figure  5— (A)  ROC  for  the  marginal  fire  states; 
(B)  plot  of  posterior  experimental  errors  where: 

P{\.^)  =  P{y\p)P{p)  and 

P{N,b)  =  P(N|£i)  P(b). 


10 


In  figure  5B  the  error  rates  are  multiplied  by  their 
respective  prior  probabilities  of  the  experiment,  thus 
giving  a  plot  of  the  expected  errors  for  the  experiment, 
namely,  the  joint  probabilities.  This  plot  is  typical  of  what 
might  be  expected  in  a  specific  application,  but  of  course  it 
will  vary  by  application  because  the  prior  probability 
distribution  will  vary  substantially. 

As  we  have  noted,  the  distributions  of  marginal  states, 
b,  are  bounded  at  the  upper  limit  when  100  percent  of  the 
fireline  is  aflame.  We  have  started  a  cursory  examination 
of  fire  states  defined  in  alternative  terms  of  fire  intensity 
without  an  upper  bound.  In  particular,  flame  length  is  also 
a  measure  of  intensity  that  shows  promise  of  describing 
and  discriminating  between  the  hotter  fire  states.  Prelim- 
inary analysis  (of  the  120+  "hot"  fires  >  63)  indicates  a 
family  of  ROC  curves  that  begin  very  close  to  those  of 
figure  5A  and  then  successively  approach  closer  to  the 
ROC  origin  as  flame  lengths  lengthen. 

CONCLUSIONS 

The  utility  of  a  fire  model  in  the  decision  process  is 
greatly  enhanced  when  a  probable  distribution  of  likely 
events  is  available  to  the  decision  maker.  These  distribu- 
tions allow  him  to  weigh  the  alternatives  and  risks  in- 
volved, that  is,  to  make  good  decisions.  When  a  deter- 
ministic fire  model  gives  a  single  value  prediction,  the  fire 
manager  is  left  wanting,  with  only  his  subjective  experi- 
ence as  guide.  How  close  to  the  true  value  is  the  predic- 
tion? What  is  the  risk  that  it  is  wrong?  Is  the  prediction 
appropriate  to  the  local  situation? 

A  fire  model  cannot  be  expected  to  respond  to  variations 
other  than  those  observed  and  measured  as  inputs  to  the 
model.  For  a  model  to  be  widely  useful,  it  should  express 
the  likelihood  (as  in  equation  13  or  in  P(n  \  b),  etc.)  that  the 
currently  measured  physical  conditions  would  be  observed 
for  the  fixed  fire  state  in  question  (or  some  equivalent  ex- 
pression). The  model  will,  then,  perform  optimally,  without 
built-in  development  biases  and  with  a  provision  for  differ- 
ences between  various  localities  (and  times).  A  large  por- 
tion of  forestry  hterature  is  comprised  of  case  studies, 
isolated  anecdotal  data,  and  reports  of  previously  unob- 
served (rare)  events.  The  task  of  compiling  data  bases  for 
use  of  "expert  systems"  by  decision  makers  must  be  done 
with  such  extreme  care  that  the  likelihood  of  any  event, 
observation,  or  condition  is  not  unreasonably  overstated. 
Furthermore,  isolated  instances  represent  a  single  data 
point  and  not  a  distribution;  hence,  they  are  of  little  value 
to  the  person  who  must  take  risks  when  one  does  not 
know  where  in  the  distribution  that  data  point  might  lie! 

The  next  task  is  to  develop  a  system  for  calculating 
ranges  of  fire  behavior  that  fire  managers  will  find  useful 
and  easily  understood. 


REFERENCES 

Egan,  James  P.  1975.  Signal  detection  theory  and  ROC 
analysis.  New  York:  Academic  Press.  277  p. 

Hahn,  G.  J.;  Shapiro,  S.  S.  1967.  Statistical  models  in 
engineering.  New  York:  Wiley  and  Sons.  355  p. 

Larson,  Harold  J.  1969.  Introduction  to  probability  theory 
and  statistical  inference.  New  York:  John  Wiley  and 
Sons.  387  p.  (Ch.  9). 

Rothermel,  Richard  C.  1972.  A  mathematical  model  for 
predicting  fire  spread  in  wildland  fuels.  Res.  Pap. 
INT-115.  Ogden,  UT:  U.S.  Department  of  Agriculture, 
Forest  Service,  Intermountain  Forest  and  Range  Experi- 
ment Station .  40  p. 

Savage,  Leonard  J.  1954.  Foundations  of  statistics.  New 
York:  John  Wiley  and  Sons.  294  p. 

Slovic,  Paul;  Lichtenstein,  Sarah.  1971.  Comparison  of 
Bayesian  and  regression  approaches  to  the  study  of  in- 
formation processing  and  judgment.  Organizational 
Behavior  and  Human  Performance.  6:  649-744. 

Wilson,  Ralph  A.,  Jr.  1985.  Observations  of  extinction  and 
marginal  burning  states  in  free  burning  porous  fuel 
beds.  Combustion  Science  and  Technology.  44:  179-194. 


11 


1 


Wilson,  Ralph  A.,  Jr.  1987.  A  theoretical  basis  for  modeling  probability  distributions  of 
fire  behavior.  Res.  Pap.  INT-382.  Ogden,  UT:  U.S.  Department  of  Agriculture, 
Forest  Service,  Intermountain  Research  Station.  11  p. 

A  method  is  proposed  for  predicting  a  probabilistic  distribution  of  fire  behavior.  A 
fire  model  gives  conditional  distributions  of  the  physical  fire  site  parameters  one  would 
observe  for  given  fire  states.  The  probabilistic  prediction  is  provided  by  Bayes  formula 
and  knowledge  of  the  prior  distribution  of  fire  states  at  specific  sites.  Thus,  the  fire 
model  is  independent  of  the  site  location.  The  techniques  of  plotting  Receiver 
Operating  Characteristics  are  demonstrated  with  examples  to  optimize  and  test  the 
fire  model's  performance. 


KEYWORDS:  decision  theory,  marginal  burning 


INTERMOUNTAIN  RESEARCH  STATION 


The  Intermountain  Research  Station  provides  scientific  knowledge 
and  technology  to  improve  management,  protection,  and  use  of  the 
forests  and  rangelands  of  the  Intermountain  West.  Research  is  de- 
signed to  meet  the  needs  of  National  Forest  managers,  Federal  and 
State  agencies,  industry,  academic  institutions,  public  and  private  or- 
ganizations, and  individuals.  Results  of  research  are  made  available 
through  publications,  symposia,  workshops,  training  sessions,  and 
personal  contacts. 

The  Intermountain  Research  Station  territory  includes  Montana, 
Idaho,  Utah,  Nevada,  and  western  Wyoming.  Eighty-five  percent  of 
the  lands  in  the  Station  area,  about  231  million  acres,  are  classified  as 
forest  or  rangeland.  They  include  grasslands,  deserts,  shrublands, 
alpine  areas,  and  forests.  They  provide  fiber  for  forest  industries, 
minerals  and  fossil  fuels  for  energy  and  industrial  development,  water 
for  domestic  and  industrial  consumption,  forage  for  livestock  and 
wildlife,  and  recreation  opportunities  for  millions  of  visitors. 

Several  Station  units  conduct  research  in  additional  western 
States,  or  have  missions  that  are  national  or  international  in  scope. 

Station  laboratories  are  located  in: 

Boise,  Idaho 

Bozeman,  Montana  (in  cooperation  with  Montana  State  University) 
Logan,  Utah  (in  cooperation  with  Utah  State  University) 
Missoula,  Montana  (in  cooperation  with  the  University  of  Montana) 
Moscow,  Idaho  (in  cooperation  with  the  University  of  Idaho) 
Ogden,  Utah 

Prove,  Utah  (in  cooperation  with  Brigham  Young  University) 
Reno,  Nevada  (in  cooperation  with  the  University  of  Nevada) 


USDA  policy  prohibits  discrimination  because  of  race,  color,  na- 
tional origin,  sex,  age,  religion,  or  handicapping  condition.  Any 
person  who  believes  he  or  she  has  been  discriminated  against  in  any 
USDA-related  activity  should  immediately  contact  the  Secretary  of 
Agriculture,  Washington,  DC  20250. 


